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Abstract. Given two graphs Gi and G2 on n vertices each, we define 
a graph G on vertex set Vi x V2 and the edge set as the union of edges 
of Gi x G 2 , Gi x G 2 , {(v,u'),(v,u")){\u',u" G V 2 } for each v £ Vx, 
and {((it', v), (u" , v))\u' , u" £ Vi} for each v £ Vi- We consider the 
completely-positive Lovasz ■& function, i.e., cp-d function for G. We show 
that the function evaluates to n whenever Gi and G2 are isomorphic 
and to less than n — l/(4n 4 ) when non-isomorphic. Hence this function 
provides a test for graph isomorphism. We also provide some geometric 
insight into the feasible region of the completely positive program. 



1 Introduction 

Let G\ = (V, E\),G-z = (V, E 2 ) be two simple undirected graphs, where V is the 
set of vertices of cardinality n and Ei,E 2 are the respective sets of edges. G\ 
and G2 are called isomorphic if there exists a bijection a : V — > V such that 
(a(x),a(y)) £ E 2 if and only if (x,y) £ E\. The graph isomorphism problem 
(GI) is the problem of determining if G\ and G 2 are isomorphic. This problem 
although clearly in class NP, has not been known to be either in P or NP- 
Complete [T^] , except for certain graphs where it is known to have polynomial 
complexity |4l7UOI14ll9l22j . 

There has been evidence suggesting that GI is not likely to be NP-Complete. 
One of them being that its counting version is reducible to its decision version 
[20 . Moreover, if the problem were NP-Complete, then the polynomial time 
hierarchy would collapse to its second level [318125] . A lot of research has there- 
fore gone into determining the largest complexity class for which it can be shown 
that GI is hard [15:26 . The largest complexity class known to be reducible to GI 
is DET [26]. The complexity aspects of GI are treated in much detail in [2117] . 

Apart from the obvious theoretical importance of determining its computa- 
tional complexity, the graph isomorphism problem finds such diverse applications 
as chemical identification |16j . scene analysis [1] and construction and enumera- 
tion of combinatorial configurations [5]. 

Several approaches to solve GI in polynomial time have been adopted. Among 
them is an approach to incrementally build an isomorphism between the graphs, 
[24] . Another approach has been to find a canonical labeling of the vertices of 
the two graphs, |6I5I21) . A comprehensive list of all the approaches is difficult 



to present here. There are some survey papers on the work published on this 
problem, such as [TT] . 

It was conjectured in |23j that the graph isomorphism problem can be reduced 
to a semidcfinitc feasibility problem. We make an attempt in this direction. 

Given two graphs G\ and Gi on n vertices each, we consider the Lovasz 
d function, for an n 2 vertex graph based on the two input graphs, with posi- 
tive semidefinite condition replaced by completely positivity condition. We show 
that if the graphs are isomorphic then the function evaluates to n and if non- 
isomorphic then it evaluates to a value less than n— 1/ (4n 4 ). Hence this provides 
a test for GI. 

2 Preliminaries 

2.1 Positive Semidefinite Matrices 

An m x m symmetric matrix M is said to be positive semidefinite if it can 
be expressed as Q ■ Q T for some m x k matrix Q. If the row vectors of Q are 
v\ , . . . , v m , then we will call this set a vector-realization of M in /c-dimensional 
space. We will denote the corresponding matrix M by M.(vi, . . . ,v m ). It is easy 
to see that there is always a vector realization in k = rank(M) dimensional 
space. 

If all entries of a positive semidefinite matrix M are non-negative, then it 
is called a doubly-non-negative (DN) matrix. Further, if M has a vector real- 
ization in which each component of each vector is non- negative (i.e., M has a 
decomposition Q ■ Q T where each entry of Q is non- negative) , then M is called a 
completely positive (CP) matrix. We will call it a non-negative vector realization 
of the CP matrix M. Every principal submatrix of a DN (resp. CP) matrix is 
DN (resp. CP). It is easy to see that every CP matrix is a DN matrix. 

It is not necessary that every decomposition Q ■ Q T of a CP matrix has all 
non-negative entries in Q. The smallest k, for which such an m x k matrix exists, 
is called the cp-rank of M. 

Theorem 1 ( |13| ). For any CP matrix M ofrankr, cp-rank(M) < r(r+l)/2. 

A geometrical view of a CP matrix is that if v\ , . . . , v m is a non- negative 
vector realization of it, then these vectors belong to the closed positive orthant 
('closed' in the sense of a polyhedron) of some orthogonal basis of the space. 

2.2 United Vectors 

Let w be any fixed unit vector. Then for every unit vector v, we call u = (w+v) jl 
a united vector with respect to w. We will drop the reference to w when it is 
unambiguous. 

Observation 2 With respect to a fixed unit vector w, 
(i) a vector u is united if and only if u • w = u 2 , 



(ii) if U\ and Ui are mutually orthogonal united vectors, then u\ + U2 is also a 
united vector. 

(Hi) let ui,...,Uk be a set of pairwise orthogonal united vectors. This set is 
maximal (i.e., no new united vector can be added to it while preserving pairwise 
orthogonality) if and only if w belongs to the subspace spanned by these vectors 
if and only if 2; Ui — w. 

(iv) for any collection of pairwise orthogonal united vectors u\, . . . ,Uj, W-Y^- Ui = 
X)j u i — 1- Further, u 2 — 1 if and only if the set is maximal, i.e., Ui = w. 

Lemma 1. Let u±, . . . ,Uk be pairwise orthogonal united vectors. Then matrices 
M(ui, . . . , ttfe) and M(ui, . . . ,Uk,w) are CP. 

Proof. Suppose ui-w = a t for i — 1, . . . , fc. Then there exists a coordinate system 
in (fc + l)-dimensional space in which u\ = (y/a±, 0, . . . , 0), U2 = (0, yja~2, 0, . . . , 0) 
so on, and w — {*J~a~i, ^fo^, ■ ■ ■ > y/a~k, \/l — X)i °«)- Since all entries are non- 
negative reals, the claim is established. □ 

3 Lovasz Theta Function [H] 

Given two graphs, each on n vertices, G\ — ([n], E\) and G2 — ([n], E-i). consider 
the semidefinite program SDP-LT given below, which computes a ^-function. 
The variable matrix Y is of size (n 2 + 1) x (n 2 + 1) with index set {ij\i,j € 
[n]}U{«}. 

SDP-LT: maximize 

i,je[n] 

subject to Y t (1) 



^ij 7 kl 


> 


, 1 < i,j, k,l < n 


(2) 


y 


= 1 




(3) 


Y 




, l<i,j<n 


(4) 




= 


,l<i,j,k<n,j^k 


(5) 


^ji,ki 


= 


, 1 < i,j,k < n, j ^ k 


(6) 


^ij 7 kl 


= 


, (i,fc)6£ lf (j,l)^E 2 


(7) 


^ij 7 kl 


= 


, {i,k)iE u (j,l)EE 2 


(8) 



Let Y be a solution of SDP-LT and let {uij\i,j 6 [n]} U {w} be a vector 
realization of Y. Then from conditions (3) and (4) every u%j is a united vector 
with respect to the unit vector w. Conditions (1) and (2) ensure that Y is DN. 

Every solution matrix Y of SDP-LT is (n 2 + 1) x (n 2 + 1) in size in which the 
last row and the last column are same as the diagonal. Hence from here onwards 
we will drop the last row and the last column and assume that Y is an n 2 x n 2 
matrix. 



Let {uij\i, j <G [7i]}U{u>} be a vector realization of any solution Y of SDP-LT. 



Consider the matrix 
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/ Un U12 ... Ui n \ 
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Each row and each column of this matrix is a set of pairwise orthogonal united 
vectors with respect to w. Hence from Observation^iv) the value of the objective 
function of SDP-LT, u 2 j: is at most n. 

Remark: The graph for which SDP-LT is a Lovasz d function, has a clique 
cover of size n (the edges of condition (5)). Hence this also establishes that the 
function value is bounded above by n. 



4 The Case of Isomorphic Graphs 

Let us continue to assume that {uij\i,j € [n]} U {w} is a vector realization of 
an arbitrary solution Y of SDP-LT. Suppose there exists any set of n vectors 
{ui 1 j 1 , . . • , Ui n j n } in which each pair has positive inner product. We will call such 
a set a complete consistent set. Observe that both i\, . . . , i n and ji, . . . ,j n are 
permutations of 1, 2, ... ,n. Hence we can rearrange them as {mio-(i), M2o-(2), ■ • • , 
u n cr(n)}- It is easy to see that in this case a is an isomorphism between G\ and 
G 2 . 

Consider the case when a is an isomorphism between G\ and G2. Consider 
a special solution of SDP-LT for this case: Let wo be some constant unit vector. 
Define w = wq and u ia ^ = w for all i and Uij — whenever j cr(i). In this 

case the matrix Y is Pa defined below. 

Definition 1. For any permutation a € S n (the symmetric group), the n 2 x 
n 2 matrix Pj 2 ' is defined by [P^]ij.u = [Pa]ij ■ [P<j]m, where P a denotes the 

\2] 

permutation matrix of a. The convex hull of{PP\ tr is a G\ , G2 isomorphism] 
will be denoted by Vg 1 ,g 2 - 

[21 

Observation 3 By construction P<j matrices are rank-1 positive semidefinite 
matrices. Since all entries of P a are non-negative, P& are CP. 

\2] [21 

If Pa is a solution of SDP-LT, then a is an isomorphism because (P a )ia-(i),ja-(j) 
= 1 for all Above discussion leads to the following lemma. 

[21 

Lemma 2. P a is a CP solution of SDP-LT if and only if a is an isomorphism 
between G\ andG2- 

The value of the objective function for Y = P a is ^^[Pa ]ia(i),i<r(i) = n - 
From the last statement of the previous section we have the following result. 



Lemma 3. The value of the objective function of SDP-LT is less than or equal 
to n. It reaches its maximum value n when G\ and Gi are isomorphic. 

[21 

Lemma 4. Let Y = ^2 aeI a a P a is a solution of SDP-LT, where a a > for 
each a £ I and ^2 aeI a a = 1. Then pj 2 ' is a solution of SDP-LT for each a € I. 

Proof. When P] 2 ' is extended to (n 2 + 1) x (n 2 + 1), then conditions (3) and 
(4) of SDP-LT are trivially satisfied. The extended matrix is equal to Q ■ Q T 
where (n 2 + 1) x 1 matrix Q has first n 2 entries same as those of P„ (i.e., first n 2 
entries is the vectorized P CT ) and the last entry is 1. Hence it satisfies conditions 
(1) and (2). The last four conditions of the SDP are satisfied by the extended 

[21 [21 

Pa because every zero condition satisfied by Y is also satisfied by P„ . Thus 

[21 

P a is a solution for every <j G /. □ 

From now on we consider SDP-LT with conditions (1) and (2) replaced with 
the condition that Y £ C* where C* is the cone of Completely Positive matrices. 
Let us call the modified program CP-LT and denote the function by cp'Q. 

Now we present the main result of this section. 

Lemma 5. The cpd function value is n if and only if G\ and Gi are isomorphic. 
Moreover, in this case the feasible region of CP-LT is equal to / Pg 1 G 2 - 

Proof. Consider a non-negative vector realization {uij\i,j G [n]} U {w} for a 
CP solution Y and the corresponding matrix W defined towards the end of 
Section 3. Since objective function attains value n, from Observation [5] vec- 
tors of each row/column form a maximal set of pairwise orthogonal united 
vectors. Also from the same Observation each row and each column adds up 
to w. Assume that the vector realization is in an ^-dimensional space. Con- 
sider the r-th component of the matrix, i.e., the matrix formed by the r-th 
component of each vector. Let us denote it by D r . Each element of D r is 
non- negative and each row and each column adds up to w r , the r-th compo- 
nent of w. Hence D r is w r times a doubly-stochastic matrix. But the vectors 
of the same row (resp. column) are orthogonal so exactly one entry is non- 
zero in each row (resp. column) if w r > 0. So D r — w r P crr for some per- 
mutation a r . We can express W by ^2 r w r P <Tr e r where e r denotes the unit 
vector along the r-th axis. Y^i is the inner product of the vectors Uy and 

U k l which is (E r w r(^ r )ije r ) ■ (Y,s w s(Pa s )kie s ) = Y^,r W r( P Tr)ij)( P Tr)kl) = 
J2r W r( F v})ij,kl- TllUS Y = 2 r «^P]y • Since J2r W r = W 2 = I, Y IS & COlwex 

combination of some of the Pj 2 '. From Lemmas |4] and [2] each ay, with w r > 0, 
is an isomorphism between G\ and G*2. Since w is a unit vector, w r > for at 
least one r. Hence G\ and Gi are isomorphic. Conversely from Lemma [2] if a is 

[21 

an isomorphism, then Pi- is a solution and its objective function value is n. 

When the cpd function has value n, the above discussion implies that the 
feasible region is contained in PgiG 2 - Conversely, from Lemma [2] and the fact 
that convex combination of CP solutions is also a CP solution, we deduce that 
Vg 1 g 2 is contained in the feasible region. □ 



5 The Case of Non-Isomorphic Graphs 



Lemma 6. Let Y be a solution of CP-LT. IfJ2j — 1 — l/(4n 4 ) for each 
i, then G\ and Gi are isomorphic. 

Proof. Let N denote the cp-rank of Y. So we have a non- negative vector re- 
alization of Y, {v,ij\i,j G [n]} U {w} in an .ZV-dimensional space. So we have 
Uij ■ uu — Yij^i for all i,j,k,l and there is an orthonormal basis of this space, 
B = {e p \p G [N]}, such that every Uij belongs to the closed positive orthant of 
this basis. 

From Theorem [T] N < n A because the rank of Y is at most n 2 . Hence from 
the statement of this lemma w ■ J2j u ij = J2j u % = Ylij => 1 — 1/(4-^0 for 
each i. Let Si = {uu,... , itm} for each i. Since it is a set of orthogonal united 
vectors with respect to w, w. iiy = . it^- < 1. 

Without loss of generality assume that w-ex > w-ej for all j. Sow-ei > 1/yN 
because w is a unit vector. If every vector in Si is perpendicular to e\, then 
w ■ can be at most \w — (w ■ ei)ei|, which is at most (1 — (w ■ ei) 2 ) 1 / 2 < 

(1 - 1/iV) 1 / 2 < 1 - l/(2N), contrary to the given fact. So there exists a vector 
Uij i G Si such that Uij i ■ e\ > 0. Let there be a k ^ ji such that Uik ■ &i > 0. As 
all vectors are in the closed positive orthant, Uij i ■ u^ > (v,ij i ■ e±)(uik ■ e\) > 0. 
This contradicts the fact that the vectors of Sj are pairwise orthogonal. Hence 
we conclude that for each i there exists a unique vector Uij i G Si such that 
• ei > 0. Thus J2]=i u v ' e i = u Vi ' e i- 

Next we will show that Uij t ■ Ukj k > l/(16iV 2 ) for all i,k G [n]. For any i, 
from the given facts 1 — l/(4iV) < w ■ J2j u ij — ( w ' e i)(J2j u ij ' e i) + ( w ~ ( w ' 
e l) e l)-(Y^j u ij-(Y,j u ij- e l) e l)- Since (Y,j u ij) 2 < i> (^-(w-ei^iHX^Wi,-- 
(Sj u y ' e i) e i) < K -10 — ( w ' e i) e i)l < 1 — V(2-W)< The last inequality has been 
established in the previous paragraph. So (u> • ei)(^ • ei) > 1/(4N) for all 
i. Hence • e\ = J2j u ij ' e i ^ 1/(4^)- 

All vectors of each Si are in the closed positive orthant hence • Ukj k > 
(«i 3l • ei)(ukj k ■ ei) > l/(16iV 2 ) for all i, fc G [n]. Thus the set {uij x , . . . ,u n j n } 
is pairwise non-orthogonal, and hence a complete consistent set. From the first 
paragraph of Section 4 we know that the permutation, a(i) — ji for all i, is an 
isomorphism between G\ and Gi. O 

Corollary 1. If Gi and Gi are non-isomorphic, then the value of the cp§- 
function of CP-LT must be less than n — l/(4n 4 ). 

6 Conclusion 

We have seen that if G\ and Gi are isomorphic, then the cpfi function value is 
n. If the graphs are not isomorphic, then the function value remains less than 
n — l/(4n 4 ). Moreover in isomorphic case the feasible region is exactly Vg 1 g 2 - 
Since a completely positive program cannot be solved in polynomial time, the 
above is not a polynomial time test. We have the following theorem. 



Theorem 4. The proposed completely positive d function takes value n when 
the graphs are isomorphic and it takes value less than n — l/(4n 4 ) when the 
graphs are non-isomorphic. This gives a test for graph isomorphism. 
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